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GEMINATION AND CONTROL OF 



BIMOLECULAR INTERACTIONS 



FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to the determination and control of 
5 bimolecular interactions and, more particularly, to the exploitation of these 
interactions for the production of new pharmaceuticals, such as vaccines, and 
diagnostic or research assays, such as antibody-based assays. 

Bimolecular interactions are important for a variety of biological 
processes, including pathological processes. Such interactions typically involve 

10 the recognition of a three-dimensional structure, such as a protein, carbohydrate 
or drug ligand, by another such structure. Nature performs with ease many 
such interactions, which so far have proven largely refractory to analysis. Such 
difficulty has had a negative impact on the fields of vaccine and drug 
development in particular, which have had to rely on a trial-and-error 

15 approach, in the absence of defined rules for the production of novel vaccines 
and other pharmaceuticals. However, such trial-and-error approaches are 
costly and inefficient. Clearly, new approaches are needed in these fields. 

The problem of vaccine and drug development, which is associated with 
bimolecular interactions, can be narrowed to the interaction between specific 

20 epitopes on the two molecules involved. In the case of two proteins, these 
epitopes can be composed of particular peptides, or of peptides and 
carbohydrates. For a drug and its receptor, these epitopes may consist of 
peptides on the receptor, and functional groups on the drug. These different 
materials would appear to indicate that these different types of bimolecular 

25 interactions would require different systems for study. However, as described 
below, all of these different epitopes can be mimicked by peptides. Thus, a 
single system for screening large numbers of peptides could, be employed to 
explore all of these different types of epitope interactions, since all of these 
interactions could be represented by different types of peptide libraries. For 

30 example, peptides which mimic a carbohydrate could be found using a random 



wo 98/20169 



PCT/IL97/00353 



2 

peptide library, which contains all possible peptides of a given length. 
Alternatively, an antigen library could be used to represent peptides derived 
from the primary sequence of an antigen, such as a protein, for example. If 
such an antigen library represented all possible peptides of a given length 
5 contained within the protein, the library could be said to represent a complete 
pepscan of the antigen. 

Such a complete pepscan could be found in a reference by Baughn et al. 
[Baughn, R,E., Demecs, M,, Taber, L.H. and D.M. Musher, Infection and 
Immunity, 1996, 64:2457-2466] for the 15-kDa lipoprotein of Treponema 

10 pallidum , which causes syphilis. Overlapping decapeptides (ten amino acids) 
were synthesized, each of which overlapped the next by nine amino acids, and 
were offset by one amino acid, so that a complete set of decapeptides was 
obtained. These were then screened with sera from syphilitic rabbits in an 
ELISA (enzyme-linked immunosorbent assay), to find those peptides which 

15 reacted with antibodies against syphilis. The limitations to such an approach 
are immediately obvious, particularly since the synthesis of such a large 
number of peptides is both tedious and difficult to manage. Clearly, producing 
complete pepscans by peptide synthesis limits the approach for small proteins. 
Indeed, Baughn et al. note that their choice of protein was strongly influenced 

20 by size. 

Thus, a new approach to the exploration of bimolecular interactions, and 
by extension to the fields of vaccine and drug development, is required. This 
approach uses combinatorial phage display peptide libraries to quickly sort 
through a huge number of peptides to find those peptides of interest, by a 

25 screening assay which functionally selects for a particular behavior in a peptide, 
as described by G.P. Smith and J. K. Scott [Scott, J.K. and G.P. Smith, 
Science, 1990, 249:386-390 and Smith, G.P. and J.K. Scott, Methods in 
Enzymol., 1993, 217:228-257]. For example, to find peptides which bind a 
particular protein, a phage display peptide library can be affinity-purified using 

30 that protein, and then reinfected into bacteria to make more phage containing 
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those peptides of interest. Thus, two problems are solved simultaneously. 
First, a huge number of peptides can be screened in a single assay. Second, 
those peptides of interest can be enriched simply by infecting bacteria with the 
phage containing those peptides, and using the biological machinery of the 
5 bacteria to make more phages of interest. Thus, combinatorial phage display 
peptide libraries can do quickly and easily what artificial laboratory techniques 
cannot. 

Such phage display peptide libraries are typically constructed in the 
following manner. Phages consist of DNA surrounded by coat proteins, which 

10 enable the phage to infect host bacteria and replicate themselves, producing 
many copies of the phage. To exploit this property, DNA sequences coding for 
the peptide of interest are inserted into the gene coding for a phage coat 
protein. As long as these insertions do not interfere with the life cycle of the 
phage, these modified phages will have coat proteins which display the foreign 

15 peptide. Filamentous phages are the preferred vectors, because two of their 
coat proteins can be easily modified to display foreign peptides, and thus 
foreign epitopes, on their surface. In general, these modifications are well 
tolerated. However, even if the modifications are not tolerated, the phage can 
still be rescued by a variety of techniques, including co-infection with a wild- 

20 type phage, known in the art as a helper phage. 

The two coat proteins of the filamentous phage of types such as Ml 3, 
fd and f2 are known as pIII and pVIIL There are only five copies of pIII on 
the phage coat, while there are about 2700 copies of pVIII on the coat. 
However, pIII can generally tolerate large insertions of up to a few hundred 

25 amino acids in length, while pVIII can tolerate only five or six amino acid 
insertions. As noted above, other techniques can be used to rescue phage with 
pVIII proteins containing larger insertions. 

There are two divergent methods for selecting the group of peptides 
which are to be inserted into the phages to form the phage library. The first 

30 type of peptide group is selected according to a known DNA or protein 
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sequence, and forms a series of overlapping peptides. These protein-derived 
peptides can be used to represent a protein epitope or an entire protein. The 
second type of peptide group is a partial or complete set of random 
oligonucleotides. The first group is clearly most useful for a defined problem; 
5 for example, the mapping of a particular epitope. The second group is clearly 
useful when an amino acid sequence for an epitope is unknown, discontinuous 
in the primary amino acid sequence, or as in the case for complex carbohydrate 
epitopes, non-existent. In the last case, peptides, called mimo topes, have been 
found which mimic a selected carbohydrate epitope. 

10 The use of each of these groups of peptides can be most easily 

demonstrated with reference to the field of vaccine development. Vaccinology 
is based upon the discovery of epitopes within the pathogen of interest which 
can be used to elicit an immune response which can neutralize that pathogen. 
Once these epitopes have been found, they can be presented to the immune 

15 system as an active vaccine, to prime the immune system against future 
infection, in most cases, without causing any infection or pathology themselves. 
Alternatively, antibodies which bind these epitopes can be isolated and 
administered as a passive vaccine. Thus, vaccinology depends upon the 
screening of large numbers of epitopes, in the hope of finding such 

20 "neutralizing epitopes", and as such is clearly amenable to the phage display 
library approach. 

One example is the screening of random peptide phage libraries with 
purified antibodies or sera from humans or animals which have been challenged 
with a particular pathogen or with an antigen of that pathogen. For example, 

25 sera from human patients immunized against a hepatitis B viral antigen, an 
envelope protein from the virus, were used to screen a random library of 
nonapeptides (peptides of nine amino acids) inserted into the coat protein pVIIL 
Phages were selected which contained a nonapeptide that was both an antigenic 
and an immunogenic mimic of the actual viral antigen [Folgori, A. et al. , 

30 EMBO, 1994, 13:2236-2243]. Such an approach has also been used in diseases 



wo 98/20169 



PCT/IL97/00353 



5 

where specific antigens are not known, in an effort to map those antigenic 
epitopes which react with antibodies in sera which have been raised against the 
pathogen itself. 

The use of protein or antigen derived phage Hbraries represents a more 
5 pathogen-specific approach. The advantage of the antigen derived approach is 
that the peptides presented by the phage are all related to the pathogen of 
interest, unlike the random peptide approach, in which many of the peptides 
will not represent any portion of the pathogen. Thus, a higher proportion of 
the phages will contain peptides with potentially useful information. 

10 An example of the use of antigen derived phage libraries to map an 

antigen is given by Wang et aL [Wang et al. , J. Immun. Methods, 1995, 
178 : 1-12] for the bluetongue virus outer capsid protein VPS . Bluetongue virus 
infects sheep and cattle. VP5 is a known antigen for this virus and is 526 
amino acids in length. The VP5 gene was partially digested using DNAase I, 

15 an enzyme which cuts DNA relatively randomly. The resulting DNA 
fragments were sorted by size, and those of about 100-200 bp were inserted 
into the phage pIII gene, and expressed in a phage display library. This library 
was screened with a monoclonal antibody to find those peptides of the VPS 
protein which bind to that antibody and two different peptides were found. As 

20 estimated by the authors, this library contained about 200 different peptides, 
including those peptides representing the vector itself. Thus, only about 70 
peptides represented the actual antigen. However, in order for every possible 
peptide of 30-70 amino acids contained within the VP5 protein to be 
represented, at least 450 different peptides would be required. Thus, this 

25 library did not even completely represent a single known antigen with 
overlapping peptides, A much more extensive library would be needed to 
represent all overlapping peptides of a given length within the antigen, thus 
generating a complete pepscan. 

Phage libraries do not need to be used simply for mapping epitopes, 

30 however. Perham and colleagues [Greenwood, J., Willis; A.E. and R.N. 



wo 98/20169 



PCT/IL97/00353 



6 

Perham, J. Mol. Biol., 1991, 220:821-827; and Willis, A.E., Perham, R.N. 
and D. Wraith, Gene, 1993, 128:79-83] have suggested that peptide epitopes 
displayed by phage can act as antigens. Discrete peptides obtained from the 
major surface protein of the malaria parasite Plasmodium falcipamm were 
5 injected into mice, and were successfully immunogenic, causing specific 
antibodies to be raised against these discrete peptides. However, the peptides 
used were few in number, and no suggestion was made that an entire phage 
library could be used as a vaccine. 

An alternative to the use of phages to display peptides which correspond 

1 0 to inserted DNA fragments is to use the DNA fragments themselves in a DNA 
vaccine. DNA vaccines are, as their name suggests, composed of DNA which 
can stimulate antigen-specific immunity within an animal. The DNA in 
question must code for the antigen of interest. Such DNA vaccines include the 
expression library immunization system (ELI) or naked DNA vaccines. As 

15 noted in Ulmer et aL [Ulmer et al. , Curr. Op, Immunol., 1996, 8:531-536], 
naked DNA vaccines have been shown to be effective against influenza vims 
in animals, A particular bonus of these naked DNA vaccines is that they can 
elicit cellular as well as antibody responses, which many conventional vaccines 
cannot. 

20 The expression library immunization system (ELI) also uses naked DNA 

coding for those proteins expressed by a particular pathogen, as described by 
Barry et al. [Barry et aL , Nature, 1995, 377:632-635]. Alternatively, proteins 
expressed in bacteria have been used to present recombinant proteins to the 
immune system, as described by Mougneau et al. [Mougneau et al. . Science, 

25 1995, 268:563-566]. However, both of these methods are somewhat limited 
in power as compared to complete pepscans, since complete pepscans can cover 
substantially every possible continuous epitope of a pathogen, while these other 
methods only present specific proteins from a pathogen. Furthermore, as noted 
by Barry et aL , large amounts of naked DNA were required for immunization. 
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Phage can also be used to present peptides for non-vaccine related 
interactions. Random peptide phage libraries have been used to target organs 
in vivo by Pasquaiini and Ruoslahti [Pasqualini, R. and E. Ruoslahti, Nature, 
1996, 380:364-366]. In their experiments, an entire random peptide phage 
5 library was injected into mice, and the mice were sacrificed 1-4 minutes later. 
Thus, although phage carrying specific peptides successfully targeted particular 
organs, the time frame did not permit any immunogenic effect to be observed, 
nor was such a potential effect even mentioned or intended by the authors. 

Random peptide phage libraries have an additional advantage over more 

10 specific complete pepscans of an antigen. Random phage libraries can be used 
to map discontinuous epitopes, while a complete pepscan can be used to map 
continuous epitopes of an antigen, because of the nature of the group of 
peptides represented. A continuous epitope can be defined as one in which the 
antigenic residues reside within a short sequence of amino acids, less than from 

15 about ten to about fifteen amino acids. This sequence should definitely not be 
any longer than the length of the average peptide. Thus, only those epitopes 
which are continuous will be mapped. However, many epitopes have been 
shown to be discontinuous. These epitopes are composed of peptides derived 
from different positions in the primary sequence, but which are adjacent in the 

20 three-dimensional structure of the protein. An antigen-derived peptide display 
library of relatively short peptides does not contain peptides which represent 
these epitopes. 

An example of the importance of discontinuous epitopes is in the study 
of HIV, or human immunodeficiency virus. HIV causes almost invariably fatal 
25 disease in humans; so far, no cure or vaccine has been found. An important 
step in HIV infection: is the binding of an HIV envelope protein, gpl20, to the 
T-cell receptor CD4. CD4 may also be important in post-binding events in 
HIV infection. Indeed, it has been suggested that CD4 changes conformation 
in response to HIV binding, and that this altered conformation may also be 
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responsible for post-binding infection events. Thus, the interaction of gpl20 
with CD4 is a natural target for vaccine design. 

Part of the difficulty in finding such a vaccine, however, is that the 
major neutralizing epitope, the V3 region of gpl20, is hypervariable, so that 
5 antibodies raised against this region tend to be specific for one type of HIV and 
not others. Antibodies have been found with much broader specificity, several 
of which clearly bind to discontinuous gpl20 epitopes. Thus, such 
discontinuous epitopes are obvious targets for mapping, as an aid to vaccine 
design. 

10 As noted above, random peptide phage libraries have been used to map 

discontinuous epitopes in a variety of systems. Cortese et al. [Cortese et al. , 
Tibtech, 1994, 6:73-80] review a number of discontinuous epitopes which have 
been found using random peptide phage Ubraries. However, screening such 
libraries with sera has not always produced significant results, probably because 

15 of the low or incomplete representation of all discontinuous epitopes. In order 
to overcome this problem, a refinement of the random peptide phage library 
approach uses constrained peptides, in which amino acids inserted around the 
random peptide define a particular structure for the peptide to assume, for 
example a loop structure. However, this approach forces specific structures to 

20 be selected, if all random peptides are to be screened. Alternatively, the 
sequence of the peptide can be held constant, and those surrounding amino 
acids which determine the structure of the peptide can be varied. In either 
case, a great deal of the power of random peptide phage libraries, namely the 
ability to search a broad group of epitopes, is reduced. Thus, more refined 

25 discontinuous epitope mapping approaches are clearly needed, which combine 
the power of random peptide phage libraries with the specificity of antigen- 
derived phage libraries. 

Carbohydrate-protein interactions have also been studied using random 
peptide phage libraries because of the ability of such libraries to potentially 

30 represent discontinuous epitopes of the carbohydrates themselves. These 
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interactions are important for a number of biological processes, including 
lymphocyte migration and binding of the hemagglutinin protein of the human 
influenza virus to erythrocyte glycoproteins, an important step in infection by 
the virus. However, these interactions have typically been difficult to study, 
5 because of the difficulty in synthesizing complex carbohydrate ligands. To 
solve this problem, peptides can be found which mimic carbohydrate ligands. 
These peptides are also called "mimotopes" because of their mimicry of the 
carbohydrate epitopes. For example, Oldenburg et ah [Oldenburg et al. , 
PNAS, 1992, 89:5393-5397] used a random octapeptide (eight amino acid) 

10 phage library and screened these phage for the ability of the peptide to bind the 
carbohydrate-binding protein concanavalin A. A group of peptides were found 
which bound to the protein, although many of these peptides had no obvious 
sequence homology. 

The interactions of many different molecules with proteins, including 

15 carbohydrates and dmgs, could be much more easily elucidated if the three- 
dimensional stracture, or tertiary native conformation, of the protein were 
known. Currently, these structures have generally been determined by using 
X-ray crystallography. However, as its name suggests, this method requires 
the protein to be capable of fomiing usable crystals, which are non-trivial to 

20 prepare. Indeed, many proteins do not form satisfactory crystals at all, 
including the vast majority of membrane-sparming proteins, such as 
neurotransmitter receptors. To overcome this barrier, a number of attempts 
have been made to use algorithms to predict the three-dimensional structure of 
a protein from its primary amino acid sequence, as described in Protein 

25 Folding , ed. by N. Jaenicke, p. 167-181, Amsterdam, Holland (1980), or 
Computer- Assisted Modeling of Receptor-Ligand Interactions, Theoretical 
Aspects and Applications to Drug Design, ed. by R. Rein and A. Golombek, 
1989, Alan R. Liss, New York for example. Commercially available 
algorithms include those from MSI, United Kingdom, including Quanta, Delphi 

30 and Charmm. However, these algorithms have generally failed to adequately 
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predict the three-dimensional structure of the protein, simply because there are 
many theoretical structures or conformers to examine, and the rules of protein 
folding are not completely known. 

A compromise between these two approaches has been the use of 
5 laboratory experiments to obtain information about the protein itself, which can 
then be used to place constraints on such protein structure determinations. 
Such information can be obtained by NMR (Nuclear Magnetic "Resonance), 
which provides information about the interactions of atoms within the protein 
in the form of distance constraints, although the distances between atoms must 

10 be relatively short (less than about 5 A). However, NMR suffers from lack of 
specificity; that is, the interactions of too many atoms are all presented 
simultaneously, making it difficult to decipher the behavior of individual atoms. 
Furthermore, NMR requires vast amounts of highly purified protein and is also 
only suitable for water soluble protein. 

15 Alternatively, electron diffraction techniques can be used for a small 

number of proteins, specifically those membrane-spanning proteins which are 
highly concentrated within the membrane, such as bacteriorhodopsin. 

One method which might be more generally applicable was described in 
a study by E. Haas [E. Haas, Computer- Assisted Modeling of Receptor-Ligand 

20 Interactions, Theoretical Aspects and Applications to Drug Design, ed. by R. 
Rein and A. Golombek, 1989, Alan R. Liss, New York, p. 157-170]. This 
method uses the interaction of fluorescent dye molecules attached to particular 
residues within the protein to obtain information about the strucmre of the 
protein. However, this method requires that the protein be purified and 

25 labelled with dye molecules, which are both non-trivial procedures. 

- Once these constraints have been obtained, algorithms are available 
which use this information to predict the three-dimensional structure of a 
protein, or at the very least to eliminate those theoretical structures which are 
not compatible with the experimental evidence. Obviously, as the number of 

30 constraints is increased, the predictive ability of these algorithms will be 
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improved correspondingly. Furthermore, it has been noted that longer range 
distance constraints, or constraints between pairs of residues which are 
relatively further apart along the primary amino acid sequence, are more useful 
dian short range distance constraints, such as those calculated by NMR [Wako, 
5 H, and H. A. Scheraga, Macromolecules, 1981, 14:961-969]. 

Clearly, such algorithms could be improved by finding constraints which 
both more accurately reflect partial structures of the protein, and which are 
more easily measured in a laboratory. 

There is thus a widely recognized need for, and it would be highly 

10 advantageous to have, a system for the discovery of discontinuous epitopes, to 
be used as vaccines, for drug design, for diagnostic purposes and for the 
elucidation of three-dimensional protein structure. Specifically, it would be 
advantageous to have a system to map discontinuous epitopes which is both 
complete, yet more specific than random phage libraries. It would also be 

15 advantageous to develop DN A vaccines which exploit the concept of 
overlapping peptides and/or discontinuous epitopes in a variety of expression 
systems. Finally, it would be advantageous to use discontinuous epitopes for 
preparing antibodies, as components of diagnostic tools, for preparing passive 
vaccines and for elucidating three-dimensional protein structure. 

20 SUMMARY OF THE INVENTION 

According to the present invention there is provided a phage display 
library, including: (a) a plurality of fragments of a substantially complete digest 
of substantially the entire genome of an organism; and (b) a plurality of phages, 
each of the phages containing one of the fragments. 

25 In the following methods, most of the examples will use DNA, rather 

than RNA, as the genetic material. It will be appreciated, however, that many 
of these methods could also be used for RNA, so that the use of the term 
"DNA" is not intended to be limiting in these examples. The term "genome" 
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is hereinafter defined as the complete genetic material of an organism, whether 
that genetic material is DNA or RNA based. 

Preferably, the organism is a pathogen selected from the group 
consisting of a virus, a bacterium, a yeast and a parasite. 
5 According to further features in preferred embodiments of the invention 

described below, there is provided a method of preparing a vaccine, including 
the steps of: (a) preparing a complete pepscan of at least one polypeptide of an 
organism; and (b) providing a vaccine carrier for the complete pepscan. The 
vaccine carrier can optionally include a pharmaceutical ly appropriate buffer, 

10 Also optionally, the complete pepscan is produced by synthesizing peptides. 
Alternatively, the complete pepscan is produced by a plurality of bacteria, the 
peptides are syntliesized by the bacteria. Optionally, the vaccine carrier 
includes a plurality of phages, the peptides are presented by the phages. 
Preferably, the phages are filamentous phages and each of the peptides is 

15 presented by a coat protein of the filamentous phages. Most preferably, the 
coat protein is pVIII or pIII. Also alternatively, the vaccine carrier includes 
an eukaryotic expression vector and the complete pepscan is presented by the 
vector to be preferably used as a DNA vaccine in which the peptides are 
ultimately synthesized by the organism to be vaccinated, 

20 According to still further features in preferred embodiments of the 

invention described below, there is provided a method of vaccinating an 
organism, including the steps of: (a) preparing a vaccine by the above method; 
and (b) administering the vaccine to the organism. 

According to another embodiment, there is provided a method of 

25 preparing a discontinuous library of an organism having a genome, including 
the steps of: (a) at least partially digesting at least a portion of the genome of 
the organism to form a plurality of fragments, the portion being characterized 
as representing at least a part of a single biological unit; and 
(b) ligating the fragments to form at least one ligated fragment; and (c) at least 

30 partially digesting the ligated fragment to form at least one conformational 
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fragment. Preferably, the biological unit is a polypeptide. Also preferably, the 
method further includes (d) providing a display carrier for the at least one 
conformational fragment. More preferably, the display carrier includes at least 
one bacteria and the at least one conformational fragment is inserted into 
5 genetic material within the at least one bacteria. Alternatively, the display 
carrier includes at least one phage and the at least one conformational fragment 
is inserted into genetic material within the phage. Preferably, the phage is a 
filamentous phage, and the at least one conformational fragment is inserted into 
a gene for a coat protein of the filamentous phage. Most preferably, tlie coat 

10 protein is selected from the group consisting of pIII and pVIII. Alternatively, 
the display carrier includes an eukaryotic expression vector and the at least one 
conformational fragment is inserted into the vector. Also provided is a 
discontinuous library of the genome of an organism, including at least one 
conformational fragment, prepared according to the above method. 

15 According to still further features in the described preferred 

embodiments, there is provided a method of preparing a conformational 
peptide, including the steps of: (a) preparing a discontinuous library of an 
organism according to the above method; (b) inserting the discontinuous library 
into an expression system; and (c) obtaining the conformational peptide from 

20 the expression system. Preferably, the expression system includes at least one 
bacteria, the discontinuous library is inserted into genetic material of the at 
least one bacteria. Most preferably, the conformational peptide is obtained 
from the expression system by isolating the conformational peptide, such that 
the conformational peptide is at least a partially purified conformational 

25 peptide. Alternatively, the expression system includes at least one phage and 
the discontinuous library is inserted into genetic material of the at least one 
phage. Also provided is a conformational peptide, including a peptide, the 
sequence of the peptide being determined by a digestion product of a ligation 
product of at least two fragments of at least a partial digest of at least a portion 

30 of the genome of an organism, the portion representing at least a part of a 



• 
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single biological unit. Optionally, the peptide is obtained from an expression 
system, the expression system including the digestion product. 

According to yet another embodiment, there is provided a method of 
preparing a vaccine, including: (a) preparing a discontinuous library according 
5 to the above method; and (b) providing a vaccine carrier for the discontinuous 
library. Optionally, the vaccine can then be administered to an organism to be 
vaccinated. 

According to yet another embodiment, there is provided a method of 
detecting an antibody for binding at least one discontinuous epitope of a single 

10 biological unit of a first organism, including the steps of; (a) preparing a 
screening library of the first organism; and (b) detecting the antibody for 
binding the discontinuous epitope by screening the screening library with 
immune material from a second organism. Optionally, the screening library 
includes a discontinuous library prepared according to the above method, and 

15 the immune material is prepared by administering at least a portion of the 
single biological unit of the first organism to the second organism. 
Alternatively, the screening library includes at least a portion of the single 
biological unit of the first organism and the immune material is prepared by 
administering a discontinuous library prepared according to the above method 

20 to the second organism. This antibody can form part of a passive vaccine, in 
another embodiment, and the vaccine can be administered to an organism to be 
vaccinated. 

According to yet another embodiment, there is provided a diagnostic tool 
for detecting an organism, including: (a) an antibody for binding at least one 

25 discontinuous epitope of the organism, the antibody being prepared by 
screening a discontinuous library with immune material to detect the antibody, 
the discontinuous library including at least one digestion product of at least one 
ligation product of digestion fragments of at least a portion of the genome of 
the organism, the portion representing at least a part of a single biological unit; 

30 and (b) a detection assay for determining when the antibody is bound to the at 
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least one discontinuous epitope of the organism. Preferably, the detection assay 
employs a detection moiety attached to the antibody. Alternatively, the 
detection assay employs a gradient, and a location of the antibody within the 
gradient is dictated by the antibody binding to the at least one discontinuous 
5 epitope of the organism. Also alternatively, the detection assay employs a 
chromatograph, and a location of the antibody within the chromatograph is 
dictated by the antibody binding to the at least one discontinuous epitope of the 
organism. 

According to still further embodiments, there is provided a method for 

10 determining a stmcture of a protein having an identified gene, including the 
steps of: (a) preparing a conformational peptide of the protein from the gene 
according to the above method; (b) screening the conformational peptide with 
a molecule, the molecule being characterized by having an interaction with the 
protein; (c) determining a sequence of the conformational peptide; and (d) 

15 deducing the structure of the protein from the sequence. Preferably, the 
molecule is an antibody, the antibody is for binding to at least one 
discontinuous epitope of the protein. Alternatively, the molecule is a ligand, 
the ligand is for binding to the protein. Also alternatively, the molecule is a 
mimotope, the mimotope is for binding a mimotope binding site of the protein. 

20 According to other embodiments of the present invention, there is 

provided a filter for determining if a theoretical structure of a protein is non- 
biological, including: (a) a dipeptide juxtaposition of the protein, the dipeptide 
juxtaposition being determined by a sequence of a conformational peptide of the 
protein, the sequence being determined by a digestion product of a ligation 

25 product of at least two fragments of at least a partial digest of at least a portion 
of the genome of an organism, the portion being characterized as representing 
at least a part of a single biological unit; and (b) an algorithm for comparing 
the dipeptide juxtaposition to the theoretical structure and for determining if the 
theoretical structure is non-biological. 
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of well known prior art experimental protocols. A further example is Cell 
Biology ed. by J.E. Celis, Academic Press Inc., 1994. 

Example 1 

Methods of Constructing a Continuous Complete Pepscan Library 
5 As described in the Background section above, a pepscan includes 

peptide fragments of a protein or protein(s) of interest. In a complete pepscan, 
these peptides overlap such that each peptide is offset from the next by one 
amino acid residue. A pepscan does not need to be limited to a single protein 
or even a group of proteins. Instead, a pepscan can also be constructed from 

10 the entire genome of an organism. Such a pepscan can be described as a 
continuous pepscan, because it is prepared from peptide fragments which 
represent continuous epitopes of the primary amino acid sequence of the protein 
or proteins. A number of methods can be employed to construct such a 
continuous complete pepscan. 

15 One method involves synthesizing peptides, such that each peptide is 

offset from the next by one residue. These peptides can be synthesized in a 
variety of ways including, but not limited to, artificial production by a peptide 
synthesizing machine. 



20 substantially complete digest of the genome of the organism of interest is 
prepared, by digesting the genome so that it is cut into fragments as described 
hereinbelow. Alternatively, a digest of a portion of the genome of the 
organism of interest can be prepared, such as a portion of the genome coding 
for a polypeptide, for example. For example, a complete pepscan of a peptide 

25 toxin, such as ricin, venoms including, but not limited to, bungarotoxin or 
bacterial toxins, might be desirable. Thus, although the following description 
will refer to "the genome of the organism", it should be understood that a 
portion of the genome of the organism could also be used. To prepare the 
digest, typically, DNA is digested by an enzyme which cleaves the DNA 



Another method uses the genome of the organism of interest. First, a 
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Finally, in yet another embodiment, there is provided a method of 
obtaining an antibody for binding at least one discontinuous epitope of a single 
biological unit of a first organism, including the steps of: (a) preparing a 
vaccination library of the first organism; (b) administering the vaccination 
5 Hbrary to a second organism for producing the antibody; and (c) detecting the 
antibody for binding at least one discontinuous epitope of the single biological 
unit of the first organism, according to the above method. Preferably, the 
vaccination library includes at a least a portion of the single biological unit of 
the first organism and the screening library is a discontinuous library prepared 
10 according to the above method. Optionally, the screening library includes at 
least a portion of the single biological unit of- the first organism and the 
vaccination library is a discontinuous library prepared according to the above 
method. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
15 The present invention is of both methods and compositions which can be 

used to analyze and control bimolecular interactions. Specifically, the present 
invention includes libraries for both continuous and discontinuous pepscans, 
which are the basis for a number of products and methods, including both 
passive and active vaccines and tools for diagnosis and structural analysis. 
20 The invention is illustrated by the following examples, which describe 

the construction of both continuous and discontinuous pepscan libraries, as well 
as methods and compositions which use these libraries. 

Please note that many experimental protocols, which are well known in 
the art, are used in the following descriptions but are not described in detail. 
25 These prior art protocols are described in a number of protocol and text books. 
One example of such a book is Molecular Cloning by J. Sambrook et al. , Cold 
Spring Harbor Laboratory Press, 1989, which is incorporated by reference as 
if fully set forth herein for the sole purpose of providing complete descriptions 
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strand. For a substantially complete digest of the genome, the enzyme DNAase 
I is often used. DNAase I cuts DNA relatively randomly; that is, it cuts 
between any two bases with relatively equal efficiency. When DNAase I is 
incubated with DNA under appropriate conditions, a collection of fragments of 
5 different sizes is obtained which initiate at substantially every single possible 
base pair. Thus, the genome is represented by DNA fragments which are 
offset by one base pair. Appropriate conditions are chosen so that a significant 
fraction of the fragments fall within a desired size range. This collection of 
fragments is a substantially complete digest of the genome of the organism. 

10 Optionally, such a digest could be prepared by mechanical shearing of 

the DNA. Such mechanical shearing is well known in the art, and can be 
accomplished by sonication of the DNA, for example. Mechanical shearing has 
the further advantage of producing fragments of a more homogeneous size, 
since shearing forces are greatest in the middle of molecules rather than the 

15 edges, and larger molecules are more susceptible to shearing than smaller 
molecules. However, the cleavage sites are still relatively random, since the 
genome of the organism is broken relatively randomly during extraction from 
the organism. One disadvantage of sonication is that the temperature can 
become very high where the ultrasound waves enter the solution, potentially 

20 damaging the DNA. 

After the digest has been prepared, the fragments are processed so that 
all fragment ends are blunt ends, if a blunt-ended vector is used (see below). 
Those fragments which fall within the desired size range are then separated 
from the collection of fragments. For example, fragments of from about 50 bp 

25 to about 150 bp can be separated by size fractionation. Since only a portion of 
the collection of fragments from the digest fall within the desired size range, 
the rest of the fragments are not usable, so a much larger quantity of starting 
genome material is required to obtain a sufficient quantity of fragments of the 
desired size range. 
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Next, a plurality of phages is prepared to receive the group of fragments 
of the desired size range. This is accomplished by modifying a gene of the. 
phage, so that each fragment can be cloned into the phage gene. The product 
of that gene, a phage protein, will then include a foreign peptide. The 
5 sequence of the foreign peptide will depend upon the inserted DNA fragment. 
Preferably, a gene for a coat protein is used, so that the phage will display the 
foreign peptide on its outer surface. Typically, filamentous phages are used 
because they have coat proteins which are easily modified to receive the foreign 
DNA fragment. The preferred coat proteins are pIII and pVIII. pIII can 

10 tolerate insertions of foreign peptides of up to a few hundred amino acids in 
length, but only five copies of pIII are on the phage coat. About 2700 copies 
of pVIII are on the phage coat, but pVIII can only tolerate insertions of up to 
five or six amino acids, unless special rescue methods are used. The choice of 
a particular coat protein depends upon both the length of the foreign peptide 

15 and upon the desired number of copies per phage of the foreign peptide. 

Once the particular coat protein has been chosen, the gene is modified 
to contain a unique blunt end restriction site, preferably at its N terminal 
region, using methods which are well known in the art. The particular nature 
of the restriction site depends upon the strain of phage which is used. For 

20 example, for the pIII construct of the fUSE5 vector of the filamentous phage 
fd-tet such a site could be inserted between the two Sfi-1 sites present on the 
vector [Scott, J.K. and G,P. Smith, Science, 1990, 249:386-390 and Smith, 
G.P. and J.K. Scott, Methods in Enzymol., 1993, 217:228-257]. However, it 
is possible to construct many such blunt-ended phage vectors, by manipulating 

25 restriction sites so that DNA fragments can be inserted where desired. 
Alternatively, oligonucleotide linkers can be used to clone the DNA fragments 
in distinct and unique restriction sites in the vector which are not necessarily 
blunt-ended. 

Finally, the selected fragments from the digest of the genome of the 
30 organism of interest are cloned into the appropriately cut phage vector, using 
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methods which are well known in the art. These phages constitute the phage 
display library. 

A number of factors must be considered in order to ensure that the phage 
display library contains a complete pepscan of the genome of the organism. 
5 Such factors relate to the desired size range of the DNA fragment and the size 
of the genome of the organism. 

The probability of DNAase I randomly digesting the genome so that 
either end of a resulting DNA fragment is in reading frame is 333%. Each 
DNA Iragment can ligate in either one or the other orientation with equal 

10 probability. Phages containing a DNA fragment which ends out of reading 
frame are not viable as the read-through downstream to the inserted fragment 
will Ukely terminate prematurely and will definitely generate an irrelevant 
peptide, instead of the engineered pIII or pVIII proteins. Phages containing a 
DNA fragment which starts out of reading frame will also probably terminate 

15 prematurely due to the appearance of a stop codon. 

These factors, when considered collectively, determine that only about 
one of eighteen, or about five percent of all of the DNA fragments can 
functionally express relevant peptides. Furthermore, since the DNA fragments 
must be in reading frame in order to produce a functional peptide, these 

20 fragments effectively code for peptides which overlap by about one residue. 
One advantage of deriving pepscans from genetic material i£^tlia*'tfie 
sequence of the genetic material does not need to be knoyioir--^^ as DNA 
is available, it can be digested and expressedjn-rphage display library, which 
can include up to from about 10^^^Jp^^6out 10^*^ phages. The size limitations 

25 of such a library easily acpeifimodate even rather large genomes, including 
those of viruses, bpj^t^a, yeast and parasites. For example, viruses have 
genomes of from about 10^ to about 10"^ bp or more. Bacteria have genomes 
of abm^t^^ bp, yeast about 10^ bp and parasites from about 10^ bp to about 
L0^p or more. The size of the library required to accommodate a complete 
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pepscan of the genome of an organism can be calculated according^ the 
following formulas. 

, J, . , genome size^ 
number of peptides ^ ~ ^ 



111 

chances of an inserted fragment yielding a viable phage ^ * — * — 

3 3 2 



1. ^ J / genome size * 18 ^ 
number of phages^^ — ^ 6 * genome size 

only 1/18 will be viable >for an approximately single-told coverage of the 
Of the number of pjmges initially required for insertion of DNA fragments. 



5 complete pem^an. Greater coverage, such as from about five- to about ten-fold 
coverageyis preferable. In any case, a complete pepscan of even the entire 
genome of a parasite of about 10^ bp can clearly be accommodated by a single 
phage display library, since using the above formulas, only about 6"^ 10^ phages 
/ould be required. 

10 As an example, consider the production of a complete pepscan of the 

entire HIV (Human Immunodeficiency Virus) genome, which is about 9000 bp. 
About 3000 peptides of a desired size range would be required in order to have 
a peptide starting at substantially every possible residue. However, since only 
about five percent of the inserted DNA fragments can express a functionally 

15 relevant peptide, in order to fully represent 3000 peptides for about one-fold 
coverage, a library of about 60,000 phages is required, of which only about 
3000 phages will be viable. Libraries of up to from about 10^ to about 10^^ 
phages can easily be prepared. Thus, a complete pepscan of the entire genome 
of HIV can easily be accommodated within such a library. 

20 In order to prepare the phage display library for a complete pepscan of 

the HIV genome, a substantially complete digest of the HIV genome would 
need to be prepared, as described above. Since HIV is a retrovirus, the 
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complete HIV RNA would need to be reverse-transcribed into DNA. This 
DNA would then need to be substantially completely digested as this term is 
defined above and fragments of the desired size range separated. For example, 
the DNA could be incubated with DNAase I under appropriate conditions and 
5 fragments of from about 75 to about 150 bp could be separated. These 
fragments would then be cloned into a coat protein gene of an appropriately cut 
filamentous phage vector, for example the pIII gene of the fUSE5 vector, or 
a pVIII vector used in conjunction with a helper phage or other method to 
produce hybrid phages [Greenwood, J., Willis, A.E. and R,N. Perham, J, 

10 MoL Biol., 1991, 220:821-827; and Willis, A.E., Perham, R.N. and D. 
Wraith, Gene, 1993, 128:79-83]. Now the complete pepscan of the HIV 
genome is in a phage display library. 

Further examples of phage display libraries containing a complete 
pepscan of the genome of an organism can also be given. For example, the 

15 typical bacterial genome is about one million bp, or 10^ bp. About 3^10^ 
peptides of the desired size range would therefore be necessary, so that a 
library of about 6^10^ phage would be required for one-fold coverage, of 
which about five percent would be viable. Such a library could be prepared for 
Mycobacterium tuberculosis , a bacteria which was previously described in the 

20 Background section above, by using the above methods. Alternatively, libraries 
could be prepared for any bacterium and especially pathogenic bacterium. 

Other examples of organisms include yeast, which has a genome of 
_about 10^ bp. A library of about 6*10*^ phage would be required for a 
complete pepscan of the yeast genome. As noted above, the preparation of 

25 such a library would involve the substantially complete digestion of the yeast 
genome, and separating DNA fragments of the desired size. These fragments 
would then be cloned into the desired phage gene, to form the phage display 
library. 

Still other examples of organisms for which such phage display libraries 
30 can be constmcted include parasites, which typically have genomes of about 
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100 million bp, or 10^ bp, as described above. Examples of such parasites are 
those which cause leislimaniasis, including Leishmania major and L> 
braziliensis braziliensis . Another example of a parasite is Plasmodium 
falciparum , which causes malaria. A phage display library of a complete 
5 pepscan of any one of these parasites could be prepared as described above. 

Thus, a phage display library could clearly be prepared for a number of 
organisms, including, but not limited to, viruses including, but not limited to, 
retrovirus species such as HIV and HTLV, hepatitis species such as Hepatitis 
A and Hepatitis B, influenza species, human papillomavirus, herpes species 

10 such as herpes simplex, RSV (respiratory syncytial virus) and cytomegalovirus; 
bacteria including, but not limited to, Mycobacterium tuberculosis , salmonella, 
staphylococcus species such as Staph, aureus , and shigella; parasites including, 
but not limited to, Plasmodium species such as Plasmodium falciparum , 
leishmania species such as Leishmania major and L. braziliensis braziliensis , 

15 entamoeba species, giardia species, trichomonas species and trypanosoma 
species; and yeasts including, but not limited to, Candida albicans . 

Example 2 

Methods of Constructing a Bacterial Display Complete Pepscan Library 
As noted above, a complete pepscan can be displayed in a phage library. 
20 However, phages are not the only display system for a complete pepscan. The 
basic principles behind such a library can be extended to a bacterial display or 
expression library. 

The first step in preparing a bacterial display library, as in preparing a 
phage display library, is to substantially completely digest the genome of the 
25 organism of interest, or a portion of the genome of the organism of interest. 
Such a digest can be prepared according to the methods given in Example 1 
above. 

In the second step, the fragments from such a digest can be cloned into 
a vector, such as a phage, a plasmid, a phagmid or a cosmid. Examples of 
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commercially available prokaryotic vectors include pKK223~3 and pTrc99, both 
available from Pharmacia Biotech. Such vectors are well known in the art. 
Although a phage can act as a vector, the phage itself does not display the 
peptide on its outer coat. Similarly, the other vectors are used only to 
5 introduce DNA fragments into the bacteria, but are not themselves used to 
display the foreign peptides. 

Finally, the vector, with the inserted fragments from the digest, is 
transfected into the bacterial strain of choice. Again, transfection procedures 
are well known in the art. These bacteria then produce the peptide coded for 

10 by the fragment from the digest. These peptides can accumulate within the 
cell. Alternatively, they can be displayed on the cell wall of the bacteria, for 
those bacteria which display proteins or peptides on their cell wall. 
Alternatively and preferably, they can be secreted into the bacterial growth 
media, from which they can be collected. 

15 A bacterial expression library containing a complete pepscan of the 

genome of an organism could be prepared for any of the organisms noted 
above, including, but not limited to, viruses including, but not limited to, 
retrovirus species such as HIV and HTLV, hepatitis species such as Hepatitis 
A and Hepatitis influenza species, human papillomavirus, herpes species 

20 such as herpes simplex, RSV (respiratory syncytial virus) and cytomegalovirus; 
bacteria including, but not limited to, Mycobacterium tuberculosis , salmonella, 
staphylococcus species such as Staph, aureus , and shigella; parasites including, 
but not limited to, Plasmodium species such as Plasmodium falciparum , 
leishmania species such as Leishmania major and L. braziliensis braziliensis , 

25 entamoeba species, giardia species, trichomonas species and trypanosoma 
species; and yeasts including, but not limited to, Candida albicans. 
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Example 3 

Methods of Constructing an Eukaryotic Display Complete Pepscan Library 
As noted in Example 2, bacteria can be used to construct an expression 
library for a complete pepscan. However, there are some disadvantages to 
5 using bacteria or phages, particularly for eukaryotic genomes. First, it is well 
known in the art that folding of eukaryotic proteins is not always done correctly 
by bacteria, which could be problematic for a peptide of sufficient length, such 
as a peptide of greater than about twenty residues. Second, correct post- 
translational modification of eukaryotic proteins is typically only performed by 

10 eukaryotes. Thus, a eukaryotic display complete pepscan library could be used 
to solve these problems. 

Such a library would be constructed in the following way. First, a 
substantially complete digest of the genome of the organism of interest, or of 
a portion of the genome of interest, is prepared, as described in Examples 1 

15 and 2 above. Second, the fragments from the digest are cloned into a 
eukaryotic expression vector. Both such vectors and such methods are well 
known in the art. For example, commercially available vectors include pSVK 
3 and pBPV, both available from Pharmacia Biotech. Finally, the eukaryotic 
expression vector, with the inserted fragment from the digest, is transfected into 

20 an appropriate eukaryotic system. The eukaryotic cell then produces the 
peptide as part of a conjugate to an eukaryotic protein. This conjugate can 
accumulate internally, be expressed on the wall of the cell or secreted into the 
eukaryotic growth media. A particular advantage of such a conjugate is that 
unlike a bacterial protein, an eukaryotic protein may have limited or no 

25 immunogenicity, which may be important if repeated immunization is required. 

Examples of such systems include yeast and mammalian cell lines, such 
as Cos-1, which is an immortal cell line. However, such transfections do not 
need to be performed only on isolated cells. For example, the eukaryotic 
expression vector, with the inserted fragment, could be introduced into cells of 

30 a whole eukaryote, without first removing those cells from the eukaryote. For 
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example, the expression vector and inserted fragment could be introduced into 
macrophages of a horse, or of a human, such that those macrophages would 
then produce the peptide coded for by the inserted DNA fragment. 

An eukaryotic expression library containing a complete pepscan of the 
5 genome of an organism could be prepared for any of the organisms noted 
above, including, but not limited to, viruses including, but not limited to, 
retrovirus species such as HIV and HTLV, hepatitis species such as Hepatitis 
A and Hepatitis B, influenza species, human papillomavirus, herpes species 
such as herpes simplex, RSV (respiratory syncytial virus) and cytomegalovirus; 

10 bacteria including, but not limited to, Mycobacterium tuberculosis, salmonella, 
staphylococcus species such as Staph, aureus , and shigella; parasites including, 
but not limited to, Plasmodium species such as Plasmodium falciparum , 
leishmania species such as Leishmania major and L. brazihensis braziliensis , 
entamoeba species, giardia species, trichomonas species and trypanosoma 

15 species; and yeasts including, but not limited to, Candida albicans . 

Example 4 

Methods of Preparing an Active Vaccine from a Complete Pepscan Library 
and Methods of Vaccinating an Organism with the Vaccine 

As noted above in Examples 1-3, there are a number of methods of 

20 preparing a complete pepscan library of a protein or proteins, or of a complete 

genome of an organism. Once such a library has been prepared, it has a 

number of potential uses, one of w^hich is as a component of an active vaccine. 

As described in the Background section above, an active vaccine causes 

at least one epitope of a first organism to be presented to the immune system 

25 of a second organism, which is the organism to be vaccinated. It should be 

noted, hov^ever, that in the case of autoimmune reactions, treatment of 

cancerous cells, or cells exhibiting inappropriate activity for the stage in the life 

cycle of the organism, the first and second organisms are effectively the same 

organism. Thus, the "epitope of the first organism" is actually an epitope 
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expressed by a cancerous cell or a cell exhibiting inappropriate activity for the 
stage in the life cycle of the organism. Alternatively, in the case of 
autoimmune reactions such as myasthenia gravis, lupus or rheumatoid arthritis^ 
the "epitope of the first organism" is normally expressed, but the immune 
5 reaction is inappropriate. In any case, the terms "first" and "second" organism 
are used below for clarity, it being understood that the "first" and "second" 
organisms can be the same organism. 

Thus, since the complete pepscan libraries described in Examples 1-3 
cause a peptide from a complete pepscan to be displayed, and since such a 

10 peptide constitutes an epitope or a collection of contiguous epitopes of the first 
organism, clearly these complete pepscan libraries can be used for presenting 
immunogenic epitopes. An active vaccine can therefore be prepared as follows. 

First, a complete pepscan display library is prepared as described in 
Examples 1-3, The first organism, whose genome, or a portion of the genome, 

15 is used for the complete pepscan, can include, but is not limited to, viruses 
including, but not limited to, retrovirus species such as HIV and HTLV, 
hepatitis species such as Hepatitis A and Hepatitis B, influenza species, human 
papillomavirus, herpes species such as herpes simplex, RSV (respiratory 
syncytial virus) and cytomegalovims; bacteria including, but not limited to, 

20 Mycobacterium tuberculosis , salmonella, staphylococcus species such as Staph. 
aureus, and shigella; parasites including, but not limited to, Plasmodium species 
such as Plasmodium falciparum , leishmania species such as Leishmania major 
and L. braziliensis braziliensis , entamoeba species, giardia species, trichomonas 
species and trypanosoma species; and yeasts including, but not limited to, 

25 Candida albicans . 

Next, the complete pepscan display library is placed in a vaccine carrier. 
The vaccine carrier is a pharmaceutical composition, which may include 
thickeners, carriers, buffers, diluents, surface active agents, preservatives, and 
the like, all as well known in the art. Pharmaceutical compositions may also 

30 include one or more active ingredients such as but not limited to immune- 
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system stimulating agents, known in the art as adjuvants, and the Hke in 
addition to the complete pepscan library. For example, the vaccine carrier can 
include pharmaceutically appropriate buffers. Alternatively, if a coinplete 
pepscan library is prepared according to Example 1, the vaccine carrier can 
5 include a plurality of phages which present the peptides of the complete 
pepscan, preferably filamentous phages. Most preferably, the DNA fragments 
are inserted into a coat protein of the phage, optionally pIII or pVIII. In this 
case an adjuvant may not be required as the phages themselves act as immune 
system stimulants [Greenwood, J., Willis, A.E. and R.N. Perham, J. Mol. 

10 Biol., 1991, 220:821-827; and Willis, A.E., Perham, R.N. and D. Wraith, 
Gene, 1993, 128:79-83]. 

Optionally, if a complete pepscan library is prepared according to 
Example 2, the DNA fragments are inserted into genetic material of the 
bacteria, so that the complete pepscan can be produced by a plurality of 

15 bacteria which synthesize the peptides of the pepscan. Since bacteria 
themselves are both highly immunogenic, as well as potentially hazardous to 
administer directly to the organism to be immunized, substantially only the 
synthesized peptides should be included in the vaccine carrier. Preferably the 
peptides are cloned as conjugates to a secreted bacterial protein, to facilitate 

20 purification. These conjugates are then collected from the bacterial growth 
media and purified from other media constituents. Proper selection of the 
bacterial protein can eliminate the need for an adjuvant, as the bacterial protein 
itself can act as the adjuvant. 

Also optionally, if a complete pepscan library is prepared according to 

25 Example 3, the vaccine carrier can include the eukaryotic expression vector 
itself, so that the complete pepscan is presented by the vector. The method of 
Example 3 can be somewhat modified, in the following manner. Instead of 
transfecting eukaryotic cells with the eukaryotic expression vector containing 
the inserted DNA fragment, the eukaryotic expression vector itself can be 

30 placed in the vaccine carrier. The combination of the eukaryotic expression 
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vector and the vaccine carrier is an example of a "naked DNA vaccine", as 
described in the Background above. 

Once the vaccine has been prepared according to one of the above 
methods, it can be administered to an organism in a number of ways, which are 
5 well known in the art. For example, administration may be done topically 
(including ophtalmically, vaginally, rectally, intranasally), orally, by inhalation, 
or parenterally, for example by intravenous drip or intraperitoneal, 
' subcutaneous, or intramuscular injection. 

Formulations for topical administration may include but are not limited 
10 to lotions, ointments, gels, creams, suppositories, drops, liquids, sprays and 
powders. Conventional pharmaceutical carriers, aqueous, powder or oily 
bases, thickeners and the like may be necessary or desirable. 

Compositions for oral administration include powders or granules, 
suspensions or solutions in water or non-aqueous media, sachets, capsules or 
15 tablets. Thickeners, diluents, flavorings, dispersing aids, emulsifiers or binders 
may be desirable. 

Formulations for parenteral administration may include but are not 
limited to sterile aqueous solutions which may also contain buffers, diluents and 
other suitable additives. 

20 Dosing is dependent on responsiveness of the immune system of the 

organism to be vaccinated, but will normally be an initial dose of the vaccine. 
If necessary a booster dose or doses can be administered at a later date to 
achieve a desired level of protection against the first organism, as measured by 
antibody titer, for example. Persons of ordinary skill in the art can easily 

25 determine optimum dosages, dosing methodologies and repetition rates. 

Examples of organisms to which such a vaccine could be administered 
include, but are not limited to, humans, pets such as dogs and cats, farm 
animals such as horses, pigs, sheep, cattle (both beef and dairy) and goats, 
laboratory animals such as mice, rats, monkeys and rabbits, and wild animals 

30 in captivity such as elephants, lions, tigers and bears, and other mammals, fish 
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including, but not limited to, trout, salmon, carp and tuna, and birds, including, 
but not limited to, poultry such as chickens and turkeys, ducks and geese. 



Example 5 

Methods of Preparing a Discontinuous Library 
5 The methods of Examples 1-4 all describe methods of preparing and 

using complete pepscan display libraries. However, the libraries prepared 
according to Examples 1-4 all contain continuous pepscans. That is, the DNA 
fragments code for a continuous peptide, which represents a faithful primary 
conformation of a fragment of the antigen or epitope. However, as noted in 

10 the Background section above, many epitopes are discontinuous. In order to 
display these epitopes in a library, a different library, referred to herein as a 
discontinuous library, is required. 

A discontinuous library can be prepared as follows. First, a digest of 
the genetic material of a single biological unit or a portion thereof is prepared, 

15 by cutting at least a portion of the genome of an organism into fragments. This 
portion of the genome should represent at least a portion or part of a single 
biological unit of the organism. The single biological unit preferably has a 
discrete biological function. 

The digest of the genetic material is preferably substantially complete, 

20 such that DNA fragments initiate at substantially every possible base pair. 
Alternatively, it can be a partial digest, such that DNA fragments initiate at a 
subset of substantially every possible base pair. In either case, an enzyme can 
be used to perform the digest. For a substantially complete digest of the DNA, 
the enzyme DNAase I is often used. As noted for Example 1, DNAase I cuts 

25 DNA relatively randomly; that is, it cuts between any two bases with relatively 
equal efficiency. When DNAase I is incubated with DNA under appropriate 
conditions, a collection of fragments is obtained which initiate at substantially 
every single possible base pair. This collection of fragments is a substantially 
complete digest of the genome of the organism. 
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For a partial digest, enzymes with greater specificity can be used, under 
appropriate conditions for activity of the enzyme. For example, the enzyme 
EcoRI recognizes the following sequence: 

123456 

5 1 GAATTC 

2 CTTAAG 

EcoRI then cuts between G and A on strand 1 (bases 1 and 2), and between A 
and G on strand 2 (bases 5 and 6). Preferably, enzymes which cut more 
frequently, such as SauIIIA, are used, SauIIIA recognizes a four-base sequence 

10 and so is more likely to encounter the appropriate sequence on the DNA, 
Alternatively, a combination of such enzymes are used. DNAase I can also be 
used under appropriate conditions such that the DNA is only partially digested, 
for example by limiting the amount of time that DNAase I is allowed to cleave 
the DNA, or by reducing the concentration of DNAase I. 

15 Optionally, either a substantially complete or a partial digest could be 

prepared by mechanical shearing of the DNA. Such mechanical shearing is 
well known in the art, and can be accomplished by sonication of the DNA, for 
example. 

After the digest, either partial or substantially complete, has been 
20 prepared, the resulting fragments are processed so that these fragments have 
appropriately cut ends, by enzymatic degradation or synthesis for example. 
Those fragments which fall within a desired size range are then separated from 
the collection of fragments. At least two of the resulting DNA fragments are 
ligated together under appropriate conditions according to methods which are 
25 well known in the art. This ligation results in at least one ligation product, 
which is simply a single DNA segment formed by ligating at least two of the 
DNA fragments from the digest. Preferably, appropriate conditions are used 
for ligation so that more than two of the fragments ligate per ligation product. 
The ligation product is then digested again to form DNA fragments, 
30 which are then preferably sorted by size to separate those fragments which fall 
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within the desired size range. Preferably this size range is larger than the size 
range from the first digest. Similarly to the first digest, this second digest can 
either be partial or substantially complete, and can be performed according to 
a variety of methods, including but not limited to, enzymatic and mechanical. 
5 If a substantially complete digest is desired, DNAase I can be used, by 
incubating the ligation product with DNAase I under appropriate conditions, in 
order to obtain at least one DNA fragment. If a partial digest is desired, again 
restriction enzymes such as EcoRI, SauIIIA or even DNAase I can be used 
under the appropriate conditions, as described above. Alternatively, 

10 mechanical shearing can be used, by sonication of the ligation product for 
example. In any case, if desired, DNA fragments which fall within a desired 
size range can be separated from the rest of the digest. 

Each of such DNA fragments is a conformational fragment; that is, a 
fragment which is potentially capable of representing a discontinuous epitope. 

15 As noted above, a discontinuous epitope includes the novel contiguous 
presentation of two different amino acid sequences which were previously 
discontinuous in the original primary amino acid sequence. The discontinuous 
library includes the DNA fragment or fragments obtained from this second 
digestion, whether partial or substantially complete. Optionally, the 

20 discontinuous library can include a display carrier for the DNA fragment or 
fragments. Preferably, the display carrier includes one of several optional 
components and is prepared according to one of several preferred methods as 
described hereinbelow. 

Preferred Method 1 : One optional component is a plurality of bacteria, so that 
25 the DNA fragment or fragments are inserted into genetic material of the 
bacteria, according to Example 2. 

Preferred Method 2 : Another optional component is a plurality of phages, so 
that the DNA fragment or fragments are inserted into genetic material of the 
phages, according to Example 1. 
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Preferred Method 3 : Yet another optional component is a eukaryotic expression 
vector, so that the DNA fragments are inserted into the expression vector, 
according to Example 3. Optionally, the eukaryotic expression vector can be 
trans fected into eukaryotic cells, according to Example 3, 
5 The number of components within the display carrier which are required 

for a substantially complete representation of a single biological unit in a 
discontinuous library depend upon a number of factors, such as the type of 
display carrier, the number of DNA basepairs required to represent the single 
biological unit, the size of fragments desired from both digests, the extent of 

10 both digests and the extent of ligation. However, since the single biological 
unit typically requires fewer DNA basepairs for representation than a 
substantially complete genome, clearly a single biological unit could be 
accommodated by a single library. 

The advantage of the discontinuous library is that it can represent 

15 discontinuous epitopes, as noted above. However, in order to reduce the 
production of illogical or artifactual, biologically irrelevant "epitopes", the 
genetic material used to produce the library must represent at least a portion of 
a single biological unit with a biological function. Examples of such biological 
units include, but are not limited to, a protein, a group of proteins such as a 

20 cytoskeleton, and non-protein structures such as a tRNA, ribozymes or a 
telomere. 

The preparation of a discontinuous library for a protein or a portion of 
a protein would thus give information about dipeptide juxtapositions within that 
protein or portion of protein. These dipeptide juxtapositions arise during 

25 folding of the protein into its three dimensional structure, when a peptide from 
one location on the primary amino acid chain is folded next to another peptide 
from a completely different location on the primary amino acid chain. An 
epitope taken from the primary amino acid chain which faithfully represents the 
primary conformation of the protein could never represent such a dipeptide 

30 juxtaposition. 
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Similarly, a discontinuous library could also give information about 
dipeptide juxtapositions within a group of proteins or quaternary structure, such 
as a cytoskeleton. The cytoskeleton is composed of a number of different 
proteins, including tubulin and actin. Since tubulin and actin are completely 
5 different proteins, a single epitope from the primary amino acid chain of either 
protein could never represent a dipeptide juxtaposition of both proteins. 
However, a discontinuous library prepared from the genes encoding both 
proteins could represent such dipeptide juxtapositions. Indeed, a discontinuous 
library could even give information about transient interactions of two or more 

10 proteins, such as the binding of gpl20 to CD4, for example. 

Also, a discontinuous library could be used to represent non-protein 
structures such as a tRNA, ribozymes or a telomere. A tRNA and a ribozyme 
are RNA molecules folded into three dimensional structures, while a telomere 
is the DNA at the ends of a chromosome, which also assumes a particular 

15 structure. Since the discontinuous library includes genetic material, such as 
DNA or RNA, prepared according to the above method, clearly the structure 
of the telomere itself could be examined using the discontinuous library. The 
RNA could also be prepared by taking the DNA which codes for the tRNA, 
preparing a discontinuous library as described above, and then making RNA 

20 from the DNA, using in vitro transcription methods well known in the art. 

Example 6 

Methods of Preparing an Active Vaccine from a Discontinuous Library 
and Methods of Vaccinating an Organism with the Vaccine 

As noted above in Example 5, there are a number of methods of 

25 preparing a discontinuous library. Once such a library has been prepared, it 

has a number of potential uses, one of which is as a component of an active 

vaccine. 
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As described in the Background section above, an active vaccine causes 
at least one epitope of a first organism to be presented to the immune system 
of a second organism, which is the organism to be vaccinated. 

Thus, since the discontinuous hbraries described in preferred 
5 embodiments of Example 5 can cause a discontinuous or discontinuous epitope 
of an organism to be displayed, clearly these discontinuous libraries can be 
used to cause immunogenic epitopes to be displayed. An active vaccine can 
therefore be prepared as follows. 

First, a discontinuous library is prepared as described in Example 5, 

10 preferably with a display carrier. The first organism, whose genome is used 
for the discontinuous library, can include, but is not limited to, viruses 
including but not limited to retrovirus species such as HIV and HTLV, hepatitis 
species such as Hepatitis A and Hepatitis B, influenza species, human 
papillomavirus, herpes species such as herpes simplex, RSV (respiratory 

15 syncytial virus) and cytomegalovims; bacteria including, but not limited to, 
Mycobacterium tuberculosis , salmonella, staphylococcus species such as Staph. 
aureus, and shigella; parasites including, but not limited to, Plasmodium species 
such as Plasmodium falciparum , leishmania species such as Leishmania major 
and L. braziliensis braziliensis , entamoeba species, giardia species, trichomonas 

20 species and trypanosoma species; and yeasts including, but not limited to, 
Candida albicans . 

Next, the discontinuous library is placed in a vaccine carrier. The 
vaccine carrier is a pharmaceutical composition, which may include thickeners, 
carriers, buffers, diluents, surface active agents, preservatives, and the like, all 

25 as well known in the art. Pharmaceutical compositions may also include one 
or more active ingredients including, but not limited to, immune-system 
stimulating agents and the like in addition to the discontinuous library. For 
example, the vaccine carrier can include pharmaceutically appropriate buffers. 
Alternatively, if a discontinuous library is prepared according to Preferred 

30 Method 1 of Example 5, the vaccine carrier can include a plurality of phages. 
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preferably filamentous phages. Most preferably, the DNA fragments are 
inserted into a gene for a coat protein of the phage, optionally pIII or pVIII. 

Optionally, if a discontinuous library is prepared according to Preferred 
Method 2 of Example 5, the DNA fragments are inserted into genetic material 
5 of the bacteria, so that the complete pepscan can be produced by a plurality of 
bacteria which synthesize the peptides of the pepscan as part of a conjugate to 
a bacterial protein. Since bacteria themselves are both highly immunogenic, 
' as well as potentially hazardous to administer directly to the organism to be 
immunized, substantially only the conjugates should be included in the vaccine 
10 carrier. These conjugates can be allowed to accumulate in the bacterial cell, 
but preferably include a secreted bacterial protein so that the conjugate is 
secreted by the bacteria, to facilitate collection of the conjugate from the 
growth media. 

Also optionally, if a discontinuous library is prepared according to 

15 Preferred Method 3 of Example 5, the vaccine carrier can include the 
eukaryotic expression vector itself. Preferred Method 3 of Example 5 can be 
somewhat modified, in the following manner. Instead of transfecting 
eukaryotic cells with the eukaryotic expression vector containing the inserted 
DNA fragment, the eukaryotic expression vector itself can be placed in the 

20 vaccine carrier. The combination of the eukaryotic expression vector and the 
vaccine carrier is an example of a "naked DNA vaccine", as described in the 
Background above. 

Once the vaccine has been prepared according to one of the above 
methods, it can be administered to the organism to be vaccinated in a number 

25 of ways, which are well known in the art. For example, administration may 
be done topically (including ophtalmically, vaginally, rectally, intranasally), 
orally, by inhalation, or parenterally, for example by intravenous drip or 
intraperitoneal, subcutaneous, or intramuscular injection. 

Formulations for topical administration may include but are not limited 

30 to lotions, ointments, gels, creams, suppositories, drops, liquids, sprays and 
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powders. Conventional pharmaceutical carriers, aqueous, powder or oily 
bases, thickeners and the like may be necessary or desirable. 

Compositions for oral administration include powders or granules, 
suspensions or solutions in water or non-aqueous media, sachets, capsules or 
5 tablets. Thickeners, diluents, flavorings, dispersing aids, emulsifiers or binders 
may be desirable. 

Formulations for parenteral administration may include but are not 
limited to sterile aqueous solutions which may also contain buffers, diluents and 
other suitable additives. 

10 Dosing is dependent on responsiveness of the immune system of the 

organism to be vaccinated, but will normally be include an initial dose of the 
vaccine. If necessary a booster dose or doses can administered at a later date 
to achieve a desired level of protection against the first organism, as measured 
by antibody titer, for example. Persons of ordinary skill in the art can easily 

15 determine optimum dosages, dosing methodologies and repetition rates. 

Examples of organisms to which such a vaccine could be administered 
include, but are not limited to, humans, pets such as dogs and cats, farm 
animals such as horses, pigs, sheep, cattle (both beef and dairy) and goats, 
laboratory animals such as mice, rats, monkeys and rabbits, and wild animals 

20 in captivity such as elephants, lions, tigers and bears, and other mammals, fish 
such as trout, salmon, carp and tuna, and birds such as poultry, including 
chickens and turkeys, ducks and geese. 

Example 7 

Methods of Preparing a Conformational Peptide 
25 A conformational peptide is a peptide which represents a discontinuous 

epitope, as described in the Background section above. Such a peptide can be 
prepared by first preparing a discontinuous library, according to the method of 
Example 5. Second, the discontinuous library is inserted into an expression 
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system, which synthesizes the peptide. Examples of such an expression system 
include, but are not limited to, viral, bacterial, eukaryotic, and artificial. 

A phage display system can be prepared as follows. First, the 
discontinuous library can be cloned into the phage. Since the discontinuous 
5 library of this embodiment is actually at least one DNA fragment, at least one 
phage is prepared to receive the at least one fragment. This is accomplished 
by modifying a gene of the phage, so that the fragment can be cloned into the 
phage gene. The product of that gene, a phage protein, will then include a 
foreign peptide. The sequence of the foreign peptide will depend upon the 

10 inserted DNA fragment. Preferably, a gene for a coat protein is used, so that 
the phage will display the foreign peptide on its outer surface. Typically, 
filamentous phages are used because they have coat proteins which are easily 
modified to receive the DNA fragment. The preferred coat proteins are pIII 
and pVIII. pIII can tolerate insertions of foreign peptides of up to a few 

15 hundred amino acids in length, but only five copies of pIII are on the phage 
coat. About 2700 copies of pVIII are on the phage coat, but pVIII can only 
tolerate insertions of five or six amino acids, unless special rescue methods are 
used. The choice of a particular coat protein depends upon both the length of 
the foreign peptide and upon the desired number of copies of the foreign 

20 peptide. 

Once the particular coat protein has been chosen, the gene is modified 
to contain a unique blunt end restriction site, preferably at its N terminal 
region, using methods which are well known in the art. The particular nature 
of the restriction site depends upon the strain of phage which is used. For 
25 example, for the pIII construct of the fUSE5 vector of the filamentous phage 
fd-tet such a site could be inserted between the two Sfi-1 sites present on the 
vector. However, it is possible to construct many such blunt-ended phage 
vectors, by manipulating restriction sites so that DNA fragments can be 
inserted where desired. Alternatively, oligonucleotide linkers can be used to 
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clone the DNA fragments in distinct and unique restriction sites in the vector 
which are not necessarily blunt-ended. 

Finally, the fragment or fragments from the digest of the ligation product 
is cloned into the appropriately cut phage vector, using methods v/hich are vvell 
5 knov^n in the art. 

A second system is a bacterial expression system. First, the 
discontinuous library can be cloned into the vector, such as a phage, a plasmid, 
a phagmid or a cosmid. Such vectors are v^ell know^n in the art. Although a 
phage can act as a vector, the phage itself does not display the peptide on its 
10 outer coat. Similarly, the other vectors are used only to introduce DNA 
fragments into the bacteria, but are not themselves used to display the foreign 
peptides . Since the discontinuous library of this embodiment is actually at least 
one DNA fragment, at least one vector is prepared to receive the at least one 
fragment. 

15 Finally, the vector, with the inserted fragment from the digest, is 

transfected into the bacterial strain of choice. Again, transfection procedures 
are well known in the art. These bacteria then produce the peptide coded for 
by the fragment from the digest. These peptides can accumulate within the 
celL Alternatively, they can be displayed on the cell wall of the bacteria or 

20 secreted into the bacterial growth media, from which they can be collected. 

A third expression system is a eukaryotic expression system, which is 
prepared as follows. The fragment or fragments from the second digest are 
cloned into a eukaryotic expression vector. Both such vectors and such 
methods are well known in the art. For example, commercially available 

25 vectors include pSVK 3 and pBPV, both available from Pharmacia Biotech. 
Next, the eukaryotic expression vector, with the inserted fragment from the 
digest, is transfected into an appropriate eukaryotic system. Again, transfection 
procedures are well known in the art. These eukaryotic cells then produce the 
peptide coded for by the fragment from the digest. These peptides can 

30 accumulate within the cell. Alternatively, they can be displayed on the cell 
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wall of the eukaryote or secreted into the eukary otic growth media, from which 
they can be collected. 

Examples of such eukaryotic systems include yeast and mammalian cell 
lines, such as Cos-1, which is an immortal cell line. However, such 
5 transfections do not need to be performed only on isolated cells. For example, 
the eukaryotic expression vector, with the inserted fragment, could be 
introduced into cells of a whole eukaryote, without first removing those cells 
from the eukaryote. For example, the expression vector and inserted fragment 
could be introduced into the macrophages of a horse, or of a human, such that 
10 those macrophages would then produce the peptide coded for by the inserted 
DNA fragment. 

Furthermore, the continuous peptide itself has a number of uses. For 
example, the continuous peptide could be used to block aligand binding to a 
receptor, such as gpI20 binding to CD4. Alternatively, the peptide could act 
15 a molecular decoy, by binding a protein or peptide. 

Example 8 

Methods of Synthesizing and of Detecting an Antibody 
for Binding a Discontinuous Epitope 

As noted above in the Background section, discontinuous and continuous 

20 epitopes of a first organism can be bound by antibodies which are produced by 

the immune system of a second organism, also referred to as the organism to 

be vaccinated. Methods of detecting antibodies which bind continuous epitopes 

are well known in the art, and generally involve screening immune material 

which contains at least one antibody. 

25 The immune material which is screened can be prepared by 

administering a vaccination entity to the second organism. Optionally, the 

vaccination entity can include at least a portion of the single biological unit of 

the first organism. Alternatively, the vaccination entity can include a 

discontinuous library. The discontinuous library can be prepared according to 
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Example 5, using Preferred Method 1, 2 or 3, Each of these methods results 
in the preparation of a discontinuous library which is presented by a display 
carrier, which can also be an expression carrier. The display carrier can be-a 
phage display library, bacterial expression library or an eukaryotic expression 
5 library, depending upon the method which is used. 

Optionally, the immune material which is screened can include serum 
with at least one antibody. For example, such serum could be obtained from 
a human patient who has been infected by, or immunized against, the first 
organism. Alternatively, such serum could be obtained from an animal, 

10 including, but not limited to, a rabbit or a mouse, which has been infected by, 
or immunized against, the first organism. 

Also optionally, the immune material can include polyclonal antibodies. 
Polyclonal antibodies can be prepared by injecting a rabbit or an animal with 
a larger blood volume such as a horse, for example, with a substance or 

15 substances which provoke an immune response. Such substances can include, 
but are not limited to, substantially the entire first organism, or whole or partial 
proteins from the first organism. The rabbit or horse then produces polyclonal 
antibodies in response to these substances. Such antibodies are called 
"polyclonal" because they are derived from more than one B-cell clone. 

20 Monoclonal antibodies, on the other hand, bind to only one epitope of 

the pathogen. Monoclonal antibodies are obtained from hybridomas, which are 
produced by fusing spleen-derived B cells which secrete a single antibody, with 
non-antibody-secreting myeloma cells. Such cells can be prepared from mice 
cells, for example, or by immortalization of human B-cells by EBV (Epstein- 

25 Barr virus), for example. 

Once the immune material has been prepared, similar methods to those 
well known in the art can be used to detect antibodies which bind discontinuous 
epitopes of a single biological unit of the first organism. As a first step, a 
screening entity is prepared. This screening-entity can include a discontinuous 

30 library which is prepared according to Example 5, using Preferred Method 1, 
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library which is presented by a display carrier. The display carrier can be a 
phage display library, a bacterial expression library or an eukaryotic expression 
library, depending upon the method which is used. Alternatively, the screening 
5 entity can include at least a portion of the single biological unit which contains 
the discontinuous epitope of interest. However, the screening entity should not 
be identical to the vaccination entity. 

After the screening entity has been chosen, it is screened with immune 
material which contains at least one antibody. Such screening can be 

10 performed in a number of different ways which are well known in the art, and 
some of which are described in Molecular Biology of the Cell . 

For example, the screening entity can be substantially immobilized on 
a solid support. Examples of such solid supports include, but are not limited 
to, porous membranes such as nylon and nitrocellulose, non-porous films such 

15 as polypropylene, or plastics. The immune material can then be incubated with 
the screening entity under appropriate conditions, so that any antibody or 
antibodies can bind to the appropriate epitope or epitopes, as presented by the 
screening library [Smith, G*P. and J.K. Scott, Methods in EnzymoL, 1993, 
217:228-2571. 

20 Finally, the presence of the bound antibody or antibodies can be detected 

in a variety of ways, which are well known in the art. For example, the 
antibody or antibodies can be directly labelled with a substance which acts as 
a marker for the presence of the antibody or antibodies. Examples of such 
substances include, but are not limited to, fluorescent dyes or radioactive 

25 substances. Alternatively, greater sensitivity can be obtained by using a 
secondary antibody which can bind to a variety of antibodies. The secondary 
antibody can be labelled as above. Optionally, an ELISA (enzyme-linked 
immunoassay) can be used, in which an enzyme, such as alkaline phosphatase, 
is lilted to the secondary antibody. A substrate is incubated with the enzyme. 
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and a sensitive chemical test is used to detect the presence of the enzymatic 
product. 

Once the antibody has been detected, the antibody can then be obtained 
from the immune material, by a number of methods which are well known in 
5 the art, for example, protein purification methods including, but not limited to, 
cliromatography. 

Example 9 
Methods of Preparing a Passive Vaccine 
and Methods of Vaccinating an Organism with the Vaccine 

10 A passive vaccine is one which already contains an antibody or 

antibodies against a particular epitope or epitopes, as opposed to an active 
vaccine, which attempts to stimulate the immune system to produce such 
antibodies. Passive vaccines can be prepared against discontinuous epitopes 
according to the following procedure. First, at least one antibody which binds 

15 at least one discontinuous epitope is detected, as described above in Example 
8. Such an antibody can also be synthesized according to the methods given 
in Example 8. 

Next, the antibody is placed in a vaccine carrier, according to Example 
4. Examples of organisms to which such a vaccine could be administered 
20 include, but are not limited to, those given in Example 4. 

Example 10 
A Diagnostic Tool for Detecting an Organism 
by Using a Conformational Unit 

Frequently it is desirable to diagnose the presence of an immune 

25 response to a particular organism in a tissue or blood sample, or even in vivo . 

Such detection can be accomplished by using a conformational peptide which 

represents an epitope of the organism of interest, putting the immune material 

which includes at least one antibody, such as the blood or tissue sample, in 

contact with the conformational peptide, and then by using a detection assay to 
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detect antibody binding to the conformational peptide. A diagnostic tool which 
can perform such a diagnosis has many potential biomedical uses, including the 
detection of organisms including, but not limited to viruses including but not 
limited to retrovirus species such as HIV and HTLV, hepatitis species such as 
5 Hepatitis A and Hepatitis B, influenza species, human papillomavirus, herpes 
species such as herpes simplex, RSV (respiratory syncytial virus) and 
cytomegalovirus; bacteria including, but not limited to, Mycobacterium 
tuberculosis , salmonella, staphylococcus species such as Staph, aureus , and 
shigella; parasites including, but not limited to, Plasmodium species such as 

10 Plasmodium falciparum , leishmania species such as Leishmania major and L. 
braziliensis braziliensis , entamoeba species, giardia species, trichomonas 
species and trypanosoma species; and yeasts including, but not limited to, 
Candida albicans . Certainly, such diagnostic tools are known for detecting the 
presence of HIV for example. However, these tools generally do not rely upon 

15 antibodies which have been specifically prepared to bind discontinuous 
epitopes. 

A novel diagnostic tool can be constructed from a conformational peptide 
representing at least one discontinuous epitope of a single biological unit of the 
organism of interest, and a detection assay for determining when an antibody 

20 is actually bound to the discontinuous epitope. It should be noted that although 
this description uses a "conformational peptide" , it could be extended to include 
a discontinuous library as described above in Example 5. Both a 
conformational peptide and a discontinuous library are hereinafter collectively 
known as a "conformational unit". The conformational peptide is prepared 

25 according to one of the methods of Example 7. These detection assays can 
employ a detection moiety attached to the conformational peptide, a density 
gradient, or a chromatograph. Such a detection moiety can include fluorescent 
dyes such as rhodamine or fluorescein. These dyes can be detected by a 
spectrophotometer, for example. 
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Density gradients are well known in the art, and are prepared in the 
following manner. First, a density gradient is prepared by layering materials 
of different densities within a container, such as a tube. The material with the 
highest density is substantially near the bottom of the container, while the 
5 material with the lowest density is substantially near the top of the container. 
An example of such a material is a sucrose solution with varying concentrations 
of sucrose. The higher the concentration of sucrose, the higher the density of 
the solution. 

After the density gradient has been prepared, a solution containing the 
1 0 conformational peptide and antibody or antibodies of interest is placed on top 
of the material with the lowest density, being careful not to disturb the 
gradient. The container with the density gradient is then subjected to 
ultracentrifugation. A conformational peptide which has been bound by an 
antibody will have a different density, and hence will move to a different layer, 
1 5 as compared to a conformational peptide which has not been bound by such an 
antibody. Therefore, each will migrate to a different position along the 
gradient. After ultracentrifugation, the location of the conformational peptide 
within the gradient can be detected according, to methods which are well known 
in the art. 

20 Yet another example of a detection assay employs a chromatograph. A 

chromatograph is a device for separating proteins according to some property. 
For example, proteins can be separated according to their relative solubility by 
thin-layer chromatography. In this procedure, a sample which includes proteins 
in solution is applied to a thin layer of absorbent material, such as cellulose or 

25 silica gel, which has been attached to a sheet of stiff material such as plastic or 
glass. At least one solvent is introduced to one edge of the absorbent material. 
As the solvent front moves through the absorbent material, the proteins are 
separated according to their relative solubility in the solvent. Alternatively, 
proteins can be separated by column chromatography, in which a sample which 

30 includes proteins in solution is applied to one end of a column containing a 
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porous solid. Such a porous solid then separates the proteins by size, or by 
some other property. Clearly, any of these chromatographs could be used to 
separate a conformational peptide which is bound by an antibody of interest 
from a conformational peptide which is not so bound. 
5 Thus, this diagnostic tool could be used as follows. First, the 

conformational unit, either conformational peptide or discontinuous library, is 
incubated with a sample containing an antibody. A sample is simply defined 
as containing an antibody, including, but not limited to, a portion of tissue or 
blood and immune material. "Incubated" is simply defined as allowing the 

10 sample to come in contact with the conformational unit under conditions which 
are appropriate for permitting the binding of the antibody to the conformational 
unit. For example, an appropriate buffer or buffers could be included, and the 
sample and conformational unit could be maintained at an appropriate 
temperature. Next, the detection assay, as described above, could be 

15 performed to determine when the conformational unit is bound by the antibody. 

Example 11 

A Diagnostic Tool for Detecting an Organism with an Antibody 
Frequently it is desirable to diagnose the presence of a particular 
organism in a tissue or blood sample, or even in vivo . Such detection can be 

20 accomplished by using an antibody or antibodies which bind to at least one 
epitope of the organism of interest, and then by using a detection assay to 
detect antibody binding. A diagnostic tool which can perform such a diagnosis 
has many potential biomedical uses, including the detection of organisms 
including, but not limited to, viruses including but not limited to retrovirus 

25 species such as HIV and HTLV, hepatitis species such as Hepatitis A and 
Hepatitis B, influenza species, human papillomavirus, herpes species such as 
herpes simplex, RSV (respiratory syncytial virus) and cytomegalovirus; bacteria 
including, but not limited to, Mycobacterium tuberculosis , salmonella, 
staphylococcus species such as Staph, aureus, and shigella; parasites including. 



wo 98/20169 



PCT/IL97/00353 



47 

but not limited to, Plasmodium species such as Plasmodium falciparum , 
leishmania species such as Leishmania major and L. braziliensis braziliensis , 
entamoeba species, giardia species, trichomonas species and trypanosoma 
species; and yeasts including, but not limited to, Candida albicans . Certainly, 
5 such diagnostic tools are known for detecting the presence of HIV for example. 
However, these tools generally do not rely upon antibodies which have been 
specifically prepared to bind discontinuous epitopes. 

A novel diagnostic tool can be constructed from an antibody for binding 
at least one discontinuous epitope of a single biological unit of the organism of 

10 interest, and a detection assay for determining when the antibody is actually 
bound to the discontinuous epitope. The antibody is prepared according to 
Example 8 . These detection assays can employ a detection moiety attached to 
the antibody, a density gradient, or a chromatograph. Such a detection moiety 
can include fluorescent dyes such as rhodamine or fluorescein. These dyes can 

15 be detected by a spectrophotometer, for example. 

Density gradients are well known in the art, and are prepared in the 
following manner. First, a density gradient is prepared by layering materials 
of different densities within a container, such as a tube. The material with the 
highest density is substantially near the bottom of the container, while the 

20 material with the lowest density is substantially near the top of the container. 
An example of such a material is a sucrose solution with varying concentrations 
of sucrose. The higher the concentration of sucrose, the higher the density of 
the solution. 

After the density gradient has been prepared, a solution containing the 
25 antibody or antibodies of interest is placed on top of the material with the 
lowest density, being careful not to disturb the gradient. The container with 
the density gradient is then subjected to ultracentrifugation. An antibody which 
has bound a discontinuous epitope will have a different density, and hence will 
move to a different layer, as compared to an antibody which has not bound 
30 such an epitope. Therefore, each will migrate to a different position along the 
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gradient. After ultracentrifugation, the location of the antibody within the 
gradient can be detected according to methods which are well known in the art. 

Yet another example of a detection assay employs a chromatograph. Ar 
chromatograph is a device for separating proteins according to some property. 
5 For example, proteins can be separated according to their relative solubility by 
thin-layer chromatography. In this procedure, a sample which includes proteins 
in solution is applied to a thin layer of absorbent material, such as cellulose or 
silica gel, which has been attached to a sheet of stiff material such as plastic or 
glass. At least one solvent is introduced to one edge of the absorbent material. 

10 As the solvent front moves through the absorbent material, the proteins are 
separated according to their relative solubility in the solvent. Alternatively, 
proteins can be separated by column chromatography, in which a sample which 
includes proteins in solution is applied to one end of a column containing a 
porous solid. Such a porous solid then separates the proteins by size, or by 

15 some other property. Clearly, any of these chromatographs could be used to 
separate an antibody which is bound to a discontinuous epitope of interest from 
an antibody which is not so bound. 

This diagnostic tool could be used as follows. First, the antibody could 
be incubated with a sample containing at least one discontinuous epitope of the 

20 first organism. A sample is simply defined as containing at least one 
discontinuous epitope of the organism, including, but not limited to, a portion 
of tissue or blood and immune material. "Incubated" is simply defined as 
allowing the sample to come in contact with the at least one discontinuous 
epitope under conditions which are appropriate for permitting the binding of the 

25 antibody to the at least one discontinuous epitope. For example, an appropriate 
buffer or buffers could be included, and the sample and antibody could be 
maintained at an appropriate temperature. Next, the detection assay is used for 
determining when the antibody is bound to ihe at least one discontinuous 
epitope of the organism. 
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Example 12 

Method of Determining a Structure of a Protein with an Identified Gene 
and of Preparing a Filter tbr Such a Structure Determination 

As noted in the Background above, the three-dimensional, or tertiary, 

5 structure of a protein can provide useful information which can be exploited for 

developing new pharmaceuticals such as vaccines or drugs, and for many other 

purposes. However, determining such a protein structure can be difficult. 

Methods are known in the art which attempt to determine a protein structure 

from the primary amino acid sequence, but these methods are not usually very 

10 successful, simply because the "rules" for protein folding are not completely 
understood. More successful methods place constraints on such protein 
structure determinations, by obtaining information about the protein itself. 

Such information can be obtained by NMR, which provides information 
about tlie interactions of atoms within the protein, as noted in the Background 

15 section above. However, NMR suffers from lack of specificity; that is, the 
interactions of too many atoms are all presented simultaneously, making it 
difficult to decipher the behavior of individual atoms. Furthermore, NMR 
requires large amounts of highly pure protein. A more specific method, using 
the interaction of fluorescent dye molecules attached to particular residues, was 

20 also described in the Background section above. However, this method 
requires that the protein be purified and labelled with dye molecules, which are 
non-trivial procedures. 

A better method for determining specific interactions between parts of 
the protein, and hence for obtaining constraints in the form of a partial 

25 structure which can be used to determine the full protein structure, requires 
only that the protein has an identified gene. By "identified*', it is meant that 
the gene is available and that the sequence of the gene is known. This method 
is as follows. 

First, a conformational peptide of the protein of interest is prepared 
30 according to Example 7. Second, the conformational peptide is screened with 
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a molecule. This molecule is characterized by having a known interaction with 
the protein of interest. By screening the conformational peptide with the 
molecule, a conformational peptide which represents a true dipeptide 
juxtaposition of the protein can be obtained. Methods of screening can include 
5 any method which permits detection of the binding of the molecule of interest 
to the protein itself, simply by substituting the conformational peptide for the 
protein in the assay . 

Next, the amino acid sequence of the conformational peptide is 
determined. One method of determining the amino acid sequence is to 

10 determine the DNA sequence of the conformational fragment which codes for 
the conformational peptide. Alternatively, the conformational peptide can be 
directly sequenced, by hand or by machine, according to methods which are 
well known in the art. This sequence can be used to determine a dipeptide 
Juxtaposition; that is, the two peptides of the protein which form a 

1 5 discontinuous epitope due to their relative locations in the folded protein. Such 
a dipeptide juxtaposition actually represents a partial structure of the protein, 
since it gives information about the relative location of two parts of the protein 
within the three-dimensional structure. 

This partial structure can be used as a basis for deducing the 

20 substantially complete three-dimensional structure of the protein. For example, 
the partial structure can be used as a constraint, in order to limit the number 
of theoretical structures which must be examined. Such a constraint is similar 
to that derived from the interaction of fluorescent dye molecules attached to 
particular residues, as described in the Background above. Thus, clearly the 

25 general use of such constraints is well known in the art. However, the specific 
use of a dipeptide juxtaposition is clearly novel and non-obvious. 

Furthermore, such a dipeptide juxtaposition can clearly be used as part 
of a filter for determining if a theoretical structure of a protein is non- 
biological. Such a filter would include a dipeptide juxtaposition of a protein 

30 as described above, and an algorithm for comparing the dipeptide juxtaposition 
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to the theoretical structure, in order to determine if the theoretical structure is 
non-biological. Such algorithms are well known in the art, as described above 
in the Background, and include commercially available algorithms such as 
Delphi and Charmm. Further references include Havel, T.F. and KJ. 
5 Wutrich, J. Mol. Bioh, 1985, 182:281-294. 

While the invention has been described with respect to a limited number 
of embodiments, it will be appreciated that many variations, modifications and 
other applications of the invention may be made. 



